Statistical Evaluation of Pronunciation Encoding
نویسندگان
چکیده
In this study we investigate the idea to automatically evaluate newly created pronunciation encodings for being correct or containing a potential error. Using a cascaded triphone detector and phonotactical n-gram modeling with an optimal Bayesian threshold we classify unknown pronunciation transcripts into the classes ’probably faulty’ or ’probably correct’. Transcripts tagged ’probably faulty’ are forwarded to a manual inspection performed by an expert, while encodings tagged ’probably correct’ are passed without further inspection. An evaluation of the new method on the German PHONOLEX lexical resource shows that with a tolerable error margin of approximately 3% faulty transcriptions a major reduction in work effort during the production of a new lexical resource can be achieved.
منابع مشابه
The Impact of Computer–Assisted Language Learning (CALL) /Web-Based Instruction on Improving EFL Learners’ Pronunciation Ability
The purpose of this study was to investigate the effect of CALL/Web-based instruction on improving EFL learners’ pronunciation ability. To this end, 85 students who were enrolled in a language institute in Rasht were selected as subjects. These students were given the Oxford Placement Test in order to validate their proficiency levels. They were then divided into two groups of 30 and were...
متن کاملJapanese Pronunciation Prediction as Phrasal Statistical Machine Translation
This paper addresses the problem of predicting the pronunciation of Japanese text. The difficulty of this task lies in the high degree of ambiguity in the pronunciation of Japanese characters and words. Previous approaches have either considered the task as a word-level classification problem based on a dictionary, which does not fare well in handling out-of-vocabulary (OOV) words; or solely fo...
متن کاملStatistical Modelling of Pronunciation: It's Not the Model, It's the Data
In this paper we describe a method to model pronunciation for ASR in the German VERBMOBIL task. Our ndings suggest that a simple model, i.e. pronunciation variants modelled by SAM-PA units and weighted with a-posteriori probabilities, can be used successfully for ASR, if there is a su cient amount of reliably transcribed speech data available. Manual segmentation and labelling of speech (especi...
متن کاملNew Feature Parameters for Pronunciation Evaluation in English Presentations at International Conferences
We have previously proposed a statistical method for estimating the pronunciation proficiency and intelligibility of presentations made in English by non-native speakers. To investigate the relationship between various acoustic measures and the pronunciation score and intelligibility, we statistically analyzed the speaker’s actual utterances to find combinations of acoustic features with a high...
متن کاملA statistical method of evaluating the pronunciation prociency/intelligibility of English presentations by Japanese speakers
In this paper, we propose a statistical evaluation method of pronunciation proficiency and intelligibility for presentations made in English by native Japanese speakers. We statistically analyzed the actual utterances of speakers to find combinations of acoustic and linguistic features with high correlation between the scores estimated by the system and native English teachers. Our results show...
متن کامل